Im2Flow: Motion Hallucination from Static Images for Action Recognition

نویسندگان

  • Ruohan Gao
  • Bo Xiong
  • Kristen Grauman
چکیده

Existing methods to recognize actions in static images take the images at their face value, learning the appearances—objects, scenes, and body poses—that distinguish each action class. However, such models are deprived of the rich dynamic structure and motions that also define human activity. We propose an approach that hallucinates the unobserved future motion implied by a single snapshot to help static-image action recognition. The key idea is to learn a prior over short-term dynamics from thousands of unlabeled videos, infer the anticipated optical flow on novel static images, and then train discriminative models that exploit both streams of information. Our main contributions are twofold. First, we devise an encoder-decoder convolutional neural network and a novel optical flow encoding that can translate a static image into an accurate flow map. Second, we show the power of hallucinated flow for recognition, successfully transferring the learned motion into a standard two-stream network for activity recognition. On seven datasets, we demonstrate the power of the approach. It not only achieves state-of-the-art accuracy for dense optical flow prediction, but also consistently enhances recognition of actions and dynamic scenes.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Research on Far-Field Face Detection for Recognition

Far-field face detection is the first step of automatic face recognition in video surveillance. In this paper, we propose a framework on extraction of high-resolution images in video that subjects are far from camera. To guarantee the tradeoff between accuracy and speed, our method uses four techniques including motion detection using Gaussian Mixture Models (GMMs), skin model detection and Ada...

متن کامل

Deep Joint Face Hallucination and Recognition

Deep models have achieved impressive performance for face hallucination tasks. However, we observe that directly feeding the hallucinated facial images into recognition models can even degrade the recognition performance despite the much better visualization quality. In this paper, we address this problem by jointly learning a deep model for two tasks, i.e. face hallucination and recognition. I...

متن کامل

Learning and Visual Hallucination

It is generally recognized that learning is a useful tool for solving vision problems. However, at present time, neither learning nor vision is a well understood problem. It is even less clear how one might be able to put the two together. In this paper I will attempt to study one small aspect of this problem of using learning in vision. In particular I will study how learning might be used for...

متن کامل

Hallucinating multiple occluded face images of different resolutions

Learning-based super-resolution has recently been proposed for enhancing human face images, known as ‘‘face hallucination’’. In this paper, we propose a novel algorithm to super-resolve face images given multiple partially occluded inputs at different lower resolutions. By integrating hierarchical patch-wise alignment and inter-frame constraints into a Bayesian framework, we can probabilistical...

متن کامل

Face hallucination based on morphological component analysis

In this paper, we formulate the face hallucination as an image decomposition problem, and propose a Morphological Component Analysis (MCA) based method for hallucinating a single face image. A novel three-step framework is presented for the proposed method. Firstly, a low-resolution input image is up-sampled via an interpolation. Then, the interpolated image is decomposed into a global high-res...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • CoRR

دوره abs/1712.04109  شماره 

صفحات  -

تاریخ انتشار 2017